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THE  BAYESIAN  INVENTORY  PROBLEM 


Chapter  1.  INTRODUCTION 


1.1  Background . 

Our  objective  is  to  improve  the  management  of  repair  parte  for  newly 
fielded  weapon  systems.  The  initial  procurement  of  each  part  is  made  a  lead 
time  before  the  system  is  fielded,  and  ia  based  on  an  engineering  estimate 
of  the  mean  part  demand  rate  per  fielded  system.  There  may  be  additional 
procurements  before  the  fielding  date.  Once  the  system  ia  fielded,  demand 
experience  accrues  and  is  used  to  update  the  forecast  of  the  demand  rate, 
improving  its  accuracy. 

Inventories  are  managed  under  a  periodic  review,  (s,S)  policy: 
when  assets  fall  to  s,  order  up  to  S.  The  period  is  as  small  as  one  week. 
The  issue  of  concern  is  how  the  expected  improvement  in  accuracy  of  the  demand 
forecast  should  affect  the  values  of  the  inventory  control  parameters. 

Formally,  we  are  seeking  to  determine  optimum  (s,S)  parameters 
-  they  change  each  period  -  when  there  is  Bayesian  updating,  periodic  review 
and  a  dynamic  mean  with  demand  randomly  distributed  about  the  mean.  It  has 
long  been  known  that  this  problem  can  be  solved  by  dynamic  programming,  Cf. 
[11] .  However,  in  the  general  case  a  multi-dimensional  state  vector  is  required, 
and  the  dynamic  programming  formulation  has  not  been  pursued.  Thus  Kaplan 
and  Kruse  unsuccessfully  solicited  interest  in  designing  a  computationally 
feasible  algorithm  [  9]  .  In  fact  we  have  not  even  found  useful  qualitative 
conclusions  about  the  Bayesian  solution  in  the  general  caae. 

Instead,  the  literature  has  focused  on  the  specific  situation  in 
which  there  is  no  cost  to  order  so  that  the  purchase  price  ia  simply  (Unit 
Price)  x  (amount  ordered).  In  this  c  se  several  interesting  results  have 
been  obtained: 

a.  OptimuDi  solutions  are  well  behaved  in  that  if  "t"  is  a  sufficient 
statistic  for  the  demand  experienced  to  date,  the  optimum  policy  is  to  order 
each  period  up  to  the  optimum  S  value  for  that  period,  and  for  a  given  period 
S  is  a  non-decreasing  function  of  ’'t". 

b.  If  the  distribution  of  demand,  conditional  on  "t",  has  certain 
properties,  the  optimum  solution  may  be  found  by  solving  a  dynamic  program 
with  only  a  single  dimensional  state  vector.  For  example,  if  the  conditional 


distribution  is  Weibull: 

f(x;w)  =  w'k'Cxv)^  ^  expl-xw]^'  x  >  o 

and 

L'ne  prior  on  w  given  "t"  is  Gamma,  then  only  a  single  dimensional  state  vector 
is  required. 

These  results  were  iirst  obtained  by  Scat!  Ill, 12]  and  are  reviewed 
and  extended  by  Azoury  1 1 ] . 

Efforts  have  also  been  made  to  determine  how  Bayesian  solutions 
relate  to  non-Bayesian  solutions.  Thus,  Scarf  looked  at  how  the  Bayes  solution 
asymptotically  approached  the  non-Bayes  solution  with  increasing  experience, 
(again  with  no  cost  to  order),  and  found  that  the  Bayes  S  values  could  approach 
from  above  or  below  the  non-Bayes  S  values.  However,  Azoury  and  Miller  do 
show,  for  an  n  period  model  for  repairable  items,  that  the  Bayes  S  is  always 
less  than  or  equal  to  the  non-Bayes  S  (2).  Again,  they  assume  no  cost  to 
ord  er . 

The  purpose  of  this  research  is  to  explore  the  feasibility  of  the 
dynamic  programming  formulation  as  a  basis  for  a  computer  algorithm,  and 
to  obtain  some  insight  into  the  impact  and  significance  of  uncertainty  about 
the  demand  rate, 

1 .2  Model. 

A  difficulty  aroae  in  formulating  a  model  for  newly  fielded  weapon  sys¬ 
tems,  i.e.  for  the  "provisioning"  phase  of  repair  part  support,  in  that  no 
rigorous  model  for  managing  inventories  after  provisioning,  when  the  demand 
forecast  stops  improving,  really  exists!  Few  if  any  real  world  inventory 
systems  forecast  demand  as  the  average  of  all  the  historical  demand  ever 
received;  rather,  some  form  of  moving  average,  exponential  smoothing  or  related 
technique  is  used.  Thus,  the  forecast  cannot  even  asymptotically  approach 
the  true  mean  so  long  as  demand  variability  does  not  diminish;  yet,  there 
is  no  practical  model  we  .'re  familial  with  which  rigorously  deals  with  the 
fact  that  the  demand  forecast  is  neither  the  true  mean  nor  approaches  it. 
StudentB  at  the  University  of  North  Carolina  under  Harvey  Wagner  have  been 
interested  in  the  problem  and  Ehrhardt  has  developed  a  "power"  approximation 
which  is  more  robust  to  the  use  of  estimated  means  and  variances  than  the 
more  rigorous  computations  which  assume  the  mean  and  variance  are  known 
(51.  Some  promising  recent  work  of  Miller  (101  attacks  a  related  problem 

theoretically  -  he  requires  the  forecast,  derived  by  exponential  oinoothing, 
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to  be  the  true  mean,  but  allows  this  mean  to  change  stochastically  each  period. 

Developing  a  rigorous  post-provisioning  model  would  be  a  major 
undertaking  in  its  own  right.  As  an  alternative,  we  assume  that  at  some 
future  period,  period  "M",  updating  of  the  inventory  control  parameters  ceases. 
While  unrealistic,  thi,s  assumption  captures  the  reality  that  even  after  provi¬ 
sioning,  the  (s,S)  parameters  will  not  reflect  knowledge  of  the  true  mean. 

The  dynamic  programming  formulation  can  quite  easily  handle  non-sta- 
tionarities;  these  may  be  due  to  changes  in  cost  and  related  parameters  such 
as  an  obsolescence  rate,  which'"  is  h  igh  only  until  weapon  design  stabilizes, 
or  non-stat ionar ity  may  reflect  changes  in  demand  per  period  as  the  number 
of  fielded  weapon  systems  increases.  The  heuristic  used  to  model  expected 
costs  after  period  "M"  cannot  handle  these  dynamic  changes  and  is  therefore 
limiting.  We  will  discuss  later  how  this  might  be  overcome  under  the  "alterna¬ 
tive  formulation"  of  the  dynamic  programming  approach. 

The  standard  dynamic  programming  formulation  limits  the  choice 
of  demand  distributions  to  those  for  which  a  sufficient  statistic  exists 
for  the  mean.  Moreover,  practicality  pretty  much  limits  choice  of  the  prior 
to  the  conjugate  distribution  (Appendix  A  reviews  sufficient  statistics  and 
conjugate  distributions).  We  discuss  an  alternative  formulation  which  allows 
any  demand  distribution  but  has  these  disadvantages:  the  prior  must  be  expressed 
as  a  discrete  histogram^  and  it  is  more  time  consuming  to  run  on  a  computer. 

The  cost  structure  allowed  in  the  dynamic  programming  formulation 
is  quite  general,  but  the  heuristic  used  to  model  costs  after  period  "M" 
does  require  a  discount  rate.  For  readers  unfamiliar  with  the  discount  rate 
concept  and  its  application  to  treatment  of  interest  rates  and  obsolescence 
the  paper  on  multi-year  holding  costs  is  recommended  [6].  In  brief,  use 
of  a  discount  rate  is  crucial  to  cost  analysis  when  inventories  may  be  held 
for  many  years,  as  is  possible  when  buys  are  made  based  on  forecasts  which 
may  be  much  too  high.  If  i  is  the  interest  rate,  per  period,  and  0  is  the 
probability  of  obsolescence  in  a  period,  the  one  period  discount  is  (l-8)/(l+i), 
where  l/(l  +  i)  is  the  present  value  and  (1-0)  is  the  probability  the  item 
can  incur  demand  (it  is  not  obsolete). 


By  this  we  mean  a  discrete  distribution  with  a  limited  number  of  points  with 
positive  probability,  and  in  which  it  may  not  be  possible  to  express  the 
probabilities  other  than  by  enumeration. 
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1  •  3  OrHaniiation  of  Report. 


Both  the  "standard"  dynamic  programming  tormulat ion  based  on  a  sufficient 
statistic  and  the  alternative  formulation  based  on  a  histogram  prior  are 
presented.  Issues  in  moving  from  formulation  to  algorithm  are  discussed. 
Starting  the  dynamic  programming  recursion  is  the  most  difficult  issue  and 
receives  an  extensive  treatment. 

An  algorithm  based  on  the  standard  formulation  was  programmed  and 
applied  to  a  simple  experimental  design.  Results  are  presented  and  conclusions 
are  drawn  . 
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Chapter  2.  DYNAMIC  PROGRAMMING  FORMULATION 
2 . 1  Recursive  Equation. 

A  periodic  review  inventory  policy  is  assumed.  The  sequence  of  events 
in  a  period  is  assumed  to  be:  receive  demand,  debiting  ossets  accordingly, 
record  holding  or  backorder  costs  based  on  current  on  hand;  receive  delivery 
of  inventory  ordered  a  lead  time  (L'  ago;  order  new  inventory  if  warranted. 
While  the  assumed  sequence  of  events  within  a  period  is  arbitrary,  the  formulation 
could  be  adjusted  to  accommodate  any  other  assumed  sequence. 

Inventory  ordered  at  period  n  does  not  affect  on  hand,  and  therefore 
backorder  and  holding  costs  in  the  interval  [u,n+L],  Therefore,  in  developing 
an  inventory  policy  for  perijd  n,  we  do  not  need  to  include  those  costs  in 
our  cost  expression. 

Define  ^n(x»tn-  as  the  expected  value  of  the  present  value  of  all 
future  costs  at  period  n,  omitting  backorder  and  holding  costs  in  [n,n+L). 
"x"  are  assets  just  before  ordering,  and  demand  experience  is  summarized 
by  the  sufficient  statistic  "t".  Implicit  is  the  use  of  a  particular  inventory 
policy  for  deciding  what  to  order  in  period  n  and  thereafter;  e.g.  the  policy 
may  bp  to  minimize  costs. 

Then  , 


Cn(x*tn)  “  Mr*)  +  +  a  1  Cn  +  1 1  • f  3(  ln  >d )  1  ‘  f4(dltn) 

d 


(2.1) 


v  =  f5(x*tn) 

where : 

y:  assets  after  ordering  in  period  n. 
fj(z):  cost  of  ordering  z  units  of  inventory. 

f 2 ( y ! t n ) :  expected  value  of  the  present  value  of  backorders  and  holding 
costs  in  period  tn+L+1)  given  inventory  at  the  end  of  the  current  period 
of  y,  and  future  demand  whose  distribution  is  conditional  on  tn ,  as  well 
as  on  the  original  prior  on  the  demand  distribution. 

a:  the  discount  rate  which  accounts  for  the  time  value  of  money  and 

the  possibility  of  obsolescence, 
d:  demand  in  period  n+1. 

f  3  ( t.  n  ■  *d )  :  the  function  for  recomputing  the  sufficient  statistic  after 

one  additional  period  hac  elapsed  and  demand  of  d  has  occurred.  Typically, 
but  not  always,  tn  is  the  2  dimensional  vector  (n,  sum  of  demands  to  date) 


■  o  f  -j  ( t  n ,  d )  ■  t  +  the  vector  (l,d).  If  demand  is  uniform,  t  would  be  (n, 
max  itaum  demand  in  any  period  exper  ienced )  I  4 )  . 

fg(d|t  )•  probability  function  for  demand  given  tn 

f5(x;tn):  A  mapping  from  (x,tn)  to  y  reflecting  the  inventory  policy, 

where  y  may  equal  x.  If  the  objective  is  solely  to  minimize  costa,  then 
y  is  found  as  the  value  which  when  substituted  into  the*  RHS  (right  hand  side) 
of  equation  (2.1)  minimizes  Cn(x,tn). 

Note  that  to  handle  non-6tat  ionar  i  t  ies ,  a  subscript  of  n  would 
be  appended  to  fj,  f  2 »  <1  f  3 ,  £4  where  f^r  would  mean  ordering  cost  parameters 
are  t  line  dependent,  f2n  would  reflect  time  dependency  of  backorder,  holding 
or  discount  parameters,  and  fjn  and  would  reflect  the  need  to  incorporate 

the  number  of  systems  fielded  in  every  period.  For  the  first  L  periods  before 
any  system  is  fielded,  f  ^ ( d  1 1  n )  is  always  0  for  d  >  0. 

2.2  Use  of  Recursive  Equation  in  an  Algorithm. 

The  major  problem  is  how  to  ge  t  W’*>  f  or  some  n  in  order  to  start 
the  iterative  process  by  which  Cn(  *,*)  can  eventually  be  determined.  Period 
zero  represents  the  present,  at  'hich  a  decision  must  be  made.  As  discussed 
in  the  introduction,  our  efforts  are  devoted  to  determining  C^(*,’),  M  being 
when  updating  of  the  inventory  control  parameters  ceases.  An  entire  section 
will  be  devoted  to  this. 

The  next  problem  is  that  there  iB  no  natural  upper  bound  on  the 
demand  which  may  be  received,  so  some  kind  of  truncation  is  necessary.  Truncation 
arises  in  three  drstinct  contexts: 

a.  Truncation  on  demand  received  in  period  n  +  1  (the  d  valueB 
in  the  recursive  equation). 

b.  Truncation  on  total  demand  in  the  next  (Lei)  periods,  used 
in  getting  f2(y  |tR). 

c.  Truncation  on  the  values  of  tn.  In  particular,  where  tn  is 
the  vector  (n,D  ),  Dn  being  total  experienced  demand  in  the  first  n  periods, 
we  are  referring  to  truncation  on  possible  values  of  . 

The  single  period  (one  week)  demand  values  were  truncated  at  a 
cumulative  probability  of  1  -  10  i.e.,  the  value  k  was  found  such  that: 

k 

I  f  ,  (d  1 1  )  >  1  -  10"'* 
d-0  A  n  _ 
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and  then 


f^(k|tn)  was  reset: 


f4<k  |t_) 


1-  I  fA(djtn) 
d-0 


n'  .  „  A '  1  n 

Lead  time  demand  was  truncated  at  a  cumulative  probability  of  1  -  10 
These  truncation  values  were  subjectively  chosen. 
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Now  let  t 


max 

n 


be  the  maximum  value  of  the  sufficient  statistic 


.max 


t"  considered  in  period  n  and  let  dn+^  be  the  single  period  demand  maximum 


in  period  n+1,  based  on  f.(d|t 

A  n 

plus  the  vector  (l.tf13?  ). 

n+1  max 


max 


max 


max 


).  It  would  be  natural  to  set  tn+^  » 


However,  based  on  a  kind  of  bootstrap  mechanism 
_ x 

at  work,  as  n  increased,  t^  would  tend  to  increase  out  of  control,  with 

the  state  space,  the  values  of  (x.t^  )  to  be  evaluated,  getting  out  of  hand. 

max 

Moreover,  the  probability  of  values  as  large  as  such  a  t  actually  occurring 
on  a  real  item  would  get  increasingly  small  as  n  increased. 

Therefore,  let  PMEAl^  and  PSDEVn  be  the  mean  and  standard  deviation 
of  demand  over  n  periods  calculated  using  the  original  prior,  before  any 
updating.  Set: 


.max 

"n 


-  [n,  PMEAN  +  A  •  PSDEV  1 
n  nJ 


where  the  value  A  was  chosen  after  come  experimentation  on  the  sensitivity 

of  results  to  this  value  (see  Appendix  C).  Truncate  single  period  demand 

.  .  .  QAX  .  t 

as  necessary  so  as  not  to  violate  the  upper  bound  on  (the  truncation 

of  lead  time  demand  is  not  affected.)  We  are  excluding  values  of  t^  which 

can  occur  only  with  very  small  probability,  although  If  our  initial  knowledge 

about  the  true  mean  is  poor,  PSDEV  will  be  large  and  t™3*  will  be  large.  Also, 

n  n 

max  ,  , 

t  does  increase  with  n. 
n 

One  might  ask  how  we  limit  the  values  of  assets,  x,  to  consider. 

The  maximum  x  would  be  the  largest  order  up  to  value,  S,  for  any  iP3*  .  The 

n 

current  computer  program  makes  a  very  conservative  guess  on  what  this  will 

be,  and  issues  a  warning  if  this  guess  proves  to  be  too  low. 

The  remaining  issue  relating  to  use  of  the  recursion  equation  is 

whether  to  assume  any  particular  form  for  the  inventory  policy.  It  is  perfectly 

feasible  to  find  the  optimum  y  for  each  (x,  t  )  vector,  which  we  will  label 
*  “ 
y  (x»tn).  Such  answers,  however,  are  not  nearly  as  convenient  to  work  with 

as  the  optimum  values  for  an  (a,S)  policy  since  potentially  we  need  to  record 

it 

a  different  y  for  each  (x.t^)  whereas  the  (e,S)  parameters,  by  definition, 
are  independent  of  x. 
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Ac  intereating  question  is  just  what  can  we  say  theoretically  about 
the  y  (x,tn).  We  assume  order  coat  ia  of  the  form  Cp  +  (UP)(y-x)  and  is 
xero  for  y  •  x. 


Claim 


If  x  <  y  (o  ,t  ) 

*  ^ 
x  <  y  (x,t  ) 

*  n  * 

Then  y  (x,tn)  -  y  (o,tn) 


The  optimum  order-up-to  level  when  assets  are  0,  y  (o.t^), 
will  also  be  the  optimum  order-up-to  level  when  assets  are  x,  for  all 
x  <y  (o.tp)  at  which  it  is  worthwhile  to  place  an  order. 

Proof 

Let  Vn(x;y,tn)  be  the  value  of  Cn  (x ;  )  if  the  amount  y-x 

is  ordered  in  period  n.  For  convenience  drop  the  argument. 

By  inspection  of  the  recursion  equation,  (2.1),  ^(x^jy)  “ 

vrfx2»y)  +  (UPHxj-Xj)  f°r  values  of  x^,  x^  such  that  y  >  Xp  x^.  Hence 

[ o , y  (x)]  -  V^lx ,  y  (x)]  •  (UP)(x) 

V  [o,y  Co) 1  -  V  [x,  y  (o)]  -  (DP)(x) 
n  *  n 

Also,  by  definition  of  y  (o) 

Vnto,y*(x)J  -  Vn(o,y*(o)]  i0 
Adding  the  second  and  third  equation, 

Vnlo,y*(x)]  -  Vn(x,y*(o)]  >  (UP)(x) 

Subtracting  this  from  the  first  equation, 


Q.E.D. 


Vntx,y  (o)l  -  Vn  ( x  ,  y  (x)J  ^0 


The  methodology  for  determining  (s,S)  values  in  the  algorithm  is 


therefore: 

*  * 

a.  Find  y  (o,tn)  and  set  S  ■  y  (o,tn) 

b.  Find  the  smallest  x  such  that  y  (x,t  )  *  x  and  set 

*  n 

8  ■  x  -  1.  It  is  easy  to  check  that  y  (S,tn)  ■  S  and  thus  s  <  S. 

Since  it  has  never  been  shown  an  (s,S)  policy  is  optimal,  there  ib  no  assurance 
there  is  not  some  x  greater  than  the  o  identified  for  which  it  would  decrease 
costs  to  order . 


If  the  prior  is  expressed  in  the  form  of  a  histogram  it  is  always  possible 
to  use  dynamic  programming  as  an  evaluator  of  some  given  policy.  If,  in 


addition,  we  restrict  the  set  of  policies  to  those  which  are  based  on  the 
total  demand  experienced  to  date,  and  do  not  consider  the  period  to  period 
pattern  of  demand,  then  dynamic  programming  can  be  used  to  find  the  optimum 
policy  or  optimum  (s,S)  policy.  The  technique  does  not  depend  on  any  assumption 
about  what  the  demand  distribution  is,  but  has  computational  disadvantages 
compared  to  the  sufficient  statistic  approach. 

We  will  first  show  how  to  write  the  recursion  equation,  (2.1), 
in  a  different  way,  one  which  is  more  readily  generalised  to  our  current 
assumptions.  Let  (y;tn  )  -  note  the  prime  -  represent  expected  future 
discounted  costs  -/hen  assets  are  y  after  the  order  is  placed  in  period  n. 
From  the  definition  of  f5(x,tn)  if  assets  after  ordering  in  period  n  are 
y,  then  assets  after  ordering  in  period  n  +  1 ,  given  that  demand  of  d  occurs, 
ar  e : 

f5ly-d,  f3(tn,d)) 

Thus  , 

C'  (y;t^)  *»  ^2^Cn^+  a  ^  ^  4'^  Cn  ^Express  i°n)  (2.2) 

n  d 

Expression  “  f  ^  f  y~d  ,  f  ^  ( t  n,d  )  ]  -  (y-d)} 

-  c;+1  £3<tn.d)i) 

To  generalize  this  to  a  histogram  prior,  let  i  be  the  set  of 
possible  mean  demand  rates;  let  f^(l;tn)  be  the  probability  of  i  after  observ¬ 
ing  where  1b  the  2-dimensional  vector  (n,  total  demand);  and  let  C’(y;tn,i) 
be  the  cost  conditional  on  what  the  true  mean  demand  rate  is.  It  is  C'(y;tn,i) 
we  calculate  recursively 

C^(y;t^i)  *  £^!y;i)  +  a  l  f ^(d ; i )( Express  ion)  (2.3) 

d 

Expression  •=  f  ^{f  ^  [  y-d  ,  f  t^.d ) )  -  (y-d)} 

*  c;+l  lf5l>"d'  f3(tn-d,|i  f3(Vd)'  ‘> 

For  purposes  of  finding  (s,S)  values,  we  set: 

Cn  (y;tn)  =  E  f6<i;tn}  ^ 
i 

and  use  the  (y;t  );e.g.  we  set  S  to  y  which  minimizes  (UP)(y)  + 

Cn  (y;tn} 

The  histogram  form  of  the  recursion  has  the  obvious  computational 
disadvantage  of  requiring  values  of  C'  for  each  i.  An  important  advantage 


ia  that  the  number  of  probability  distributions  which  must  be  calculated 

depends  on  the  number  of  histogram  points  rather  than  on  the  number  of  values 

of  t  . 
n 
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Chapter  3.  STARTING  THE  RECURSION 


We  wish  to  determine  values  of  C^(x,t^)  over  the  range  of  (x,t^), 
assuming  that  the  (s,S)  values  are  not  updated  after  period  M. 

Our  approach  relies  on  being  able  to  approximate  the  priors  at 
period  M  (there  is  a  different  prior  for  each  value  of  t^  )  by  histograms. 
Other  simplifications  were  made  which  while  not  essential,  simplified  coding 
and  reduced  running  times.  Before  reviewing  these,  let  us  make  several  obser¬ 
vations  in  support  of  the  belief  that  the  details  of  the  approach  are  not 
critical. 

a.  The  primary  purpose  of  this  research  or  any  implemented  algorithm 
would  be  to  improve  the  decisions  made  before  or  around  the  time  of  fielding 
when  uncertainty  is  greatest.  Intuitively,  because  of  discounting,  the  impact 
of  the  values  C  *  ,  ')  on  the  values  Cn(*,  *) ,  M  >  n,  must  decrease  as  the 
time  Detween  M  and  n  increases. 

b.  The  decisions  made  at  period  n  do  not  depend  on  the  absolute 

values  of  Cn(*,*),  but  the  relative  values,  i.e.,  C  (0,t  )  -  C  (x,t  ),  and 
11  n’n  n  n 

therefore  depend  only  on  the  relative  costs  at  period  M.  Inspection  of  the 
recursion  equation  shows  that  the  decrease  in  cost  for  x  assets,  relative 
to  0  assets,  for  x  <  s,  is  always  (up)(x). 

c.  The  impact  of  non-optimal  (s,S)  values  at  period  M  on  the  relative 
costs  associated  with  different  asset  levels  at  period  M  is  likely  to  be 
a  second  order  effect.  The  primary  effect  would  je  to  overstate  somewhat 
the  costs  which  must  be  incurred  for  all  values  of  starting  assets,  and  even 
then  it  is  frequently  found  in  inventory  modelling  that  costa  are  flat  in 
the  region  of  optimality. 

3.1  S impl if icat ion . 

We  calculate  the  reorder  quantity  (S  minus  s)  without  regard  to  its 
impact  on  backorders,  then  calculate  s,  rather  than  jointly  optimizing  s,S. 

We  also  assume  that  when  an  order  is  placed,  assets  are  exactly  "s";  and 
we  also  assume  when  it  is  convenient  to  do  so,  that  backorder  and  holding 
co 8 1 s  are  recorded  whenever  the  asset  level  changes,  rather  than  only  at 
the  end  of  the  period.  With  a  review  period  of  only  one  week,  these  simplifi¬ 
cations  are  not  unreasonable,  but  the  assumption  that  we  can  order  at  exactly 
"8"  would  be  inadequate  for  items  characterized  by  low  frequency  of  demand, 
but  large  requisition  sizes  [7], 
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3.2 


3.3 


dotation. 

L(s,S;i): 


B(x;p,i): 
H(x;p,i) : 


*6^ i »CM^ : 
f7(p;x,i); 

G(x; i) : 


expected  lifetime  costs  including  backorder  costs  when  the 

mean  demand  rate  is  i,  an  (s,S)  policy  is  followed  and  assets 

of  s  are  initially  on  hand  and  zero  are  due-in. 

expected  backorder  and  holding  costs,  respectively 

over  the  next  p  periods  when  the  mean  demand  rate  is  i,  initial 

on  hand  is  x,  and  there  will  be  no  additional  stock  arriving 

during  the  p  periods. 

as  earlier,  probability  the  true  demand  rate  is  i;  the  tM 
subscript  will  be  dropped  in  this  section. 

probability  mass  function  on  the  number  of  periods  to  accumulate 
demand  >.  x  units,  given  a  mean  demand  rate  of  i. 

IaPf7(p;x,i)  where  a  is  the  discount  rate 


Cpi  fixed  cost  to  order 

Q:  the  order  quantity,  S  minus  s 

Cost (Q; i ) :  expected  lifetime  ordering  and  holding  costs,  but  not  backorder 
costs,  for  initial  assets  of  0,  given  i,  and  assuming  Q  is 
ordered  whenever  assets  are  zero. 

Determination  of  (3-s): 


Minimize  Z  f,(i)  Coat  (Q;i) 

Q  6 

Cost  (Q ; 1)  -  Cp  +  (up) (Q)  +  lH(Q;»,i)  -  H(Q;L,i) ] 

+  G(Q;i)  Cost  (Q;i) 

-  tCp  +  (up)  Q  +  H  (Q;°°,i)  -  H(Q;L,i)]/[l-G(Q;i)] 

In  the  expression  for  holding  cost,  H(Q;~,i)  is  total  lifetime 
holding  cost  on  the  Q  units  assuming  instantaneous  delivery,  from  which  costB 
over  the  first  L  periods,  H(Q;L,i)  are  subtracted,  since  the  Q  units  are 
still  on  order  then. 

G(Q;i)  is  the  expected  value  of  the  present  value  of  spending  one 
dollar,  given  that  the  dollar  will  be  spent  after  Q  units  are  demanded. 
If  the  item  survives  that  long,  i.e.,  it  does  not  become  obsolete  before 
then,  expected  costs  from  that  point  on  are  Cost(Q;i)  again. 
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3 .4  Determination  of  s. 

The  cost  of  a  policy  cannot  be  defined  independently  of  the  current 
asset  position.  The  8  chosen  is  the  smallest  s  such  that  it  is  as  good  or 
better  to  use  an  (s,e+Q)  policy  rather  than  an  (s+1,  s+Q+l)  policy*  given 
assets  of  s  +  1.  It  is  assumed  for  convenience  all  s  +  l  assets  are  on  hand, 
but  the  coat  difference  between  any  two  policies  is  independent  of  the  status 
of  assets  -  on  hand  or  on  order  -  since  policy  decisions  cannot  affect  backorder 
or  holding  costs  for  a  lead  time. 

We  wish  to  find  the  lowest  s  for  which  A(s)  is  >  o,  where: 

A(s)  -  l  f6(i)  |L1(i)  -  L2(i)] 

1^(1)  =  L(s+1,  s+Q+l ; i)  -  H(s+l;»,i) 

L2(l)  h  G(l;l)  [L(s,s+Q;i)  -  H(s;«,i)) 

Both  L^(l)  and  L2(i)  exclude  holding  co6t  on  the  initial  s  +  l  assets.  In 
the  case  of  L^i),  holding  cost  until  the  first  demand  -  at  which  time  assets 
fall  to  r.  -  are  never  charged,  so  only  the  remaining  costs  need  to  be  netted 
out.  The  use  of  G(l;i)  discounts  back  to  the  initial  time  wheD  assets  were 
(s  +  l)  . 

In  the  computation  of  the  Cost(Q;i)  we  saw  a  renewal  process  at 
work  in  that  if  the  item  survived,  free  of  obsolescence,  until  Q  demands 
were  incurred,  expected  costs  from  that  point  equalled  expected  costs  from 
the  original  starting  point.  We  can  take  advantage  of  the  same  logic  if 
we  exclude  holding  cost  on  the  initial  s  assets,  defining: 

L'( s , s+Q ; i)  *  L(s,s+Q;i)  -H(s;®,i) 

Ju6t  as  we  will  associate  with  each  order  of  Q  the  incremental 
holding  costs  caused  by  ordering  those  Q  unit6,  we  can  associate  the  incremental 
backorder  cost  caused  by  backorders  which  will  be  eliminated  by  these  Q  units; 
every  backorder  is  eventually  eliminated  by  a  due-in,  so  if  we  associate 
the  backorder  and  the  due-ir  which  eliminates  it,  we  will  accouDt  for  every 
backorder,  but  never  double  count. 
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Thus : 


L'(a,s+Q;i)  -  Cp  +  (up)  Q  +  H  (Q)  +  B  (Q)  +  G  (Q;i)  L'(b,8-K);1) 

where  H(q)  is  the  incremental  holding  cost  caused  by  ordering  the  Q  units 
and  B(Q)  is  a  backorder  computation  to  be  described. 

Note  first  that 

H(Q)  -  [H(s+Q;» , i)  -  H(s;®,i)]  “  [H(s+Q;L,i)  -  H(s;L,i)] 

where  the  first  term  in  brackets  is  the  added  cost  assuming  instantaneous 
delivery,  and  the  second  term  corrects  for  the  lead  time. 

Also : 

B(Q)  -  B(a;L,i)  -  B(a+Q;L,i) 

We  are  netting  out  those  backorders,  the  B(s+0;L,i),  which  would  occur  even 
if  all  Q  units  were  delivered  instantaneously.  These  backorders  will  be 
eliminated  by  some  subsequent  orders  and  will  be  associated  with  them. 

Summer izing 

L(s,3+Q; i)  -  L'  (3,s+Q;i)  4-  H(s;«,i) 

L'(s,s+Q;i)  -  [Cp  +  (up)  Q  +  H  (Q)  +  B  (Q))/[l  -  G(Q;i) ] 

H(Q)  *  [H(B+Q;«ti)  -  H(s;»,i)]  -  [H(a+Q;L,i)  -  H(s;l,i)] 

B(Q)  -  B(a;L,i)  -  B(a+Q;L,i) 

3.5  Computation  of  CH(x.tH). 

Given  the  form  of  the  recursion  equation,  (2.1),  all  backorder  and  holding 
costs  in  [M,  M+L]  are  accounted  for  by  use  of  the  £2^ylCn^  *n  Per’-0C*8  ^M-L) 
thru  (M-l).  Therefore,  the  C^(x,t^)  represent  the  expected  value  of  all 

costa  after  period  M,  net  of  backorder  and  holding  costs  ir.  [M+l,  M+l]  . 
The  computation  makes  uses  of  the  L(s,S  ;i)  with  appropriate  netting  out. 
It  is  assumed  the  x  assets  referenced  by  C  w(x,t_,)  are  all  on  hand,  but  this 
is  for  convenience  only  since  it  is  only  holding  and  backorder  costs  in  [M,M+Ll 
which  are  affected  by  the  status  of  the  x  units,  and  the  final  answer  doe6 
not  reflect  these  costs. 


18 


Claim . 


C^s;^)  =  Cm(s;lm)  +  (up)  (s-x)  for  x  <  s  (3.5a) 

CM(s;t^)  -  T  f6(i;tM)  <L(s.S;i)  -  B(s;L,i)  -  H(s;L,i)}  (3.5b) 

(^(x;^)  l  f6(i;tM)  [Tj  CD  +  T2  (i)  ]  for  x  >  s  (3.5c) 

T^i)  *=  C(x-s;  i)  (L(s,S;l)  -  H(3;«,i)l 
T2(i)  =  H(x;°°,  i)  -  H(x;L,i)  -  B(x;L,i) 

Explanation .  The  equations  follow  almost  entirely  from  the  same 
arguments  we  have  been  making  throughout  this  and  earlier  sections.  The 
logic  underlying  (a)  was  used  in  the  proof  of  the  lemma  on  the  properties 

of  the  optimum  inventory  control  parameters.  Equation  (b)  is  a  special  case 
of  ( c  ) . 

Tj  (i)  represents  the  expected  value  of  the  present  value  of  all 
future  coEts,  net  of  all  holding  costs  on  the  initial  x  units.  The  approach 
to  holding  cost  is  supported  by  the  same  logic  underlying  the  derivation 

of  L2(ij  in  the  6ub-section  on  determination  ot  6.  Since  the  x  units  are 
initially  on  hand,  no  backorders  can  occur  in  the  time  interval  until  assets 
fall  to  s,  so  l^(i)  includes  all  backorder  costs.  L(s,S;i)  discounts  them 

back  to  the  time  when  assets  first  fall  to  s,  and  G(x-s)i)  discounts  them 

further  back  to  period  M. 

T?(i)  adds  back  in  the  holding  cost  on  the  initial  assets  which 
is  incurred  after  period  M+L,  and  subtracts  out  backorders  in  (M,M+L). 

3 . 6  More  General  Formulations . 

Suppose  that  updating  of  tht  demand  forecast  ceases  after  period 
M,  but  non-stat  lonar  i  r.y  persists  m  that  additional  weapon  systems  are  fielded. 

We  may  identify  a  Deriod  M2  ,  M2  >  M,  after  which  we  are  willing 
to  assume  conditions  stabilize.  If  we  choose  large  enough,  it  will  make 

little  difference  what  we  assume,  because  discounting  will  minimize  the  impact 
on  our  current  decisions,  and  w?  may  use  a  very  crude  scheme  for  evaluating 
the  worth  of  assets  at  M9  n dative  to  0  assets. 
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Under  the  histogram  approach  for  per  iods  M2  >_  n  M,  referring  to 
equation  (2.3), 

+  a  ^  f^(d;i)  (Expression) 

d 

Expression  -  f x [f5 (y-d, t^)  -  (y-d) ]  +  c^+1 [f5(y-d,tM) ;tM,i] 

Note  that  in  accordance  with  our  assumption  the  sufficient  statistic  is  not 

updated.  Because  of  this  it  is  more  feasible  to  extend  the  dynamic  programming 

to  the  additional  interval  lM,M~).  If  tyi  must  Kt>  updated,  will  continue 

•4  1  n 

to  grow  as  n  increases  past  M  (recall  discussions  on  truncation)  and  the 
number  of  states  which  must  be  evaluated  will  continue  to  grow,  although 
interpolation  may  be  possible  (see  Appendix  C). 

Under  the  sufficient  statistic  approach  (equation  2.2),  we  must 

wr ite : 

^n  ^’Sl’Si^  "  +  a  E  f^(d|  t  )  (Expression) 

d 

Expression  -  f1[fs(y-d,tM)  -  y-d]  4-  C^+1  [f^y-d,^)  jt^tJ 

In  particular,  the  sufficient  statistic  must  be  updated  since  it  determines 
the  distribution  of  future  demands.  If  we  set  f  ^  ( d  |  tn  )  =  f^(d|t^)  for  all 
n  >  M ,  we  are  ignoring  the  correlation  between  demand  in  successive  periods, 
i.e.,  if  demand  in  M+l  is  large,  this  make6  it  more  likely  demand  in  M+2 
will  be  large  since  it  is  more  likely  the  true  mean  is  higher  than  the  mean 
implied  by  f^(d|t^). 
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Chapter  4.  MODEL  RESULTS 


Inputs  and  Assumptions 


The  model  was  run  for  a  number  of  different  examples  to  get  a  feel  for 
how  uncertainty  affects  the  optimum  (s,S)  values,  and  what  the  cost  implications 
of  ignoring  uncertainty  are. 

All  examples  were  set  in  the  provisioning  context  in  that  there 
was  no  demand  for  the  first  lead  time  before  fielding.  To  simplify  the  inter¬ 
pretation,  it  was  assumed  that  full  deployment  occurred  at  the  fielding  date, 
so  that  once  demand  began  it  occurred  at  a  constant  mean  rate  per  period. 
The  prior  on  the  mean  was  assumed  to  be  Gamma,  and  the  distribution  of  demand 
was  assumed  to  be  Poisson.  The  mathematics  for  these  specific  distributions, 
including  how  to  update  the  prior  when  deployment  is  increasing,  is  given 
in  Appendix  B. 

Other  assumptions  were  that  the  review  period  was  one  week  and 

that  updating  of  the  (s,S)  values  ceased  after  one  year.  In  all  cases  the 

cost  to  procure  was  $450  and  the  discount  rate  was  80  percent  for  one  year 
1/52 

and  therefore  .80  for  one  week.  No  holding  cost  was  charged  since  estimates 
for  US  Army  wholesale  inventories  are  only  31,  and  ignoring  holding  costs 
simplified  the  coding  of  the  algorithm.  The  discount  rate  reflects  interest 
and  obsolescence  costs  which  are  significant. 

A  simple  experimental  design  was  used.  A  base  case  was  defined 


as : 

Procurement  Lead  Time  in  Weeks  (L)  52. 

Unit  Price  (UP):  $1400. 

Average  Yearly  Demand,  i.e.  Mean  of  the  Prior  (AYD)  10. 

Each  of  these  variables  was  then  varied,  one  at  a  time,  to  two  other  settings: 

L  of  26  or  78,  UP  of  $350  or  $5600,  AYD  of  5  or  20.  For  example,  one  case 

run  was  for  L  =  26 ,  UP  =  1400,  AYD  =  10. 

Each  case  was  run  for  three  uncertainty  levels  as  measured  by  the 

coefficient  of  variation  (y):  the  ratio  of  the  standard  deviation  of  the 

prior  to  its  mean.  This  ratio  is  independent  of  the  period  of  time  to  which 

the  prior  is  applied,  whether  it  be  the  prior  on  mean  demand  for  one  week 

-6 

or  one  year  (Appendix  B) .  Uncertainty  settings  used  were  10  (essentially 
no  uncertainty),  0.5  and  1.0. 

Each  case  was  also  run  for  at  least  two  different  levels  of  backorder 
For  steady  state  models  without  discounting,  for  which  the  holding 

21 

/ 


cost . 


cost  includes  interest  and  obsolescence  costs,  there  is  a  relationship  between 
the  degree  of  protection  afforded  by  the  optimum  reorder  point  and  the  ratio 
of  backorder  cost  to  holding  costs,  so  that  it  is  more  informative  to  report 
this  ratio  than  the  specific  value  of  the  backorder  co6t.  In  our  model  holding 
cost  in  a  sense  is  the  complement  of  the  discount  rate,  i.e.,  20  percent, 
and  all  backorder  costs  are  discounted  back  to  the  period  for  which  the  (s,S) 
values  are  being  determined.  This  motivated  us  to  define  a  backorder  ratio 
(BR)  and  report  this  value,  where  for  a  discount  of  80  percent, 

8L/52 

[Backorder  Cost]  =  ( BR)(UP)( .20) 

The  base  case  was  run  for  BR  values  of  4,  10,  100,  500  and  all  other  cases 
were  run  with  BR  values  of  10  and  100. 

4.2  F ind ings . 

Table  1  shows  the  optimum  (s,Q)  values  at  time  of  fielding  where  Q  is 
defined  as  S  minus  s.  Thus,  for  the  case  where  lead  time  differs  from  the 
base  case  and  equals  78,  BR  =  100  and  Y  =  1/2, the  optimum  reorder  point  (s) 
is  26  and  the  optimum  reorder  quantity  (Q)  is  7. 

Let  us  first  say  something  about  the  results  for  when  there  is 
no  uncertainty,  Y  =  0.  There  is  some  tendency  for  Q  values  to  decrease  as 
BR  increases;  while  not  very  intuitive,  this  is  characteristic  of  stationary, 
undiscounted  models  with  certainty  [si*  Otherwise  Q's  behavior  is  intuitive, 
and  so  is  that  of  s,  which  increases  with  BR ,  L  and  UP,  the  last  because 
higher  UP  results  in  lower  Q's. 

Uncertainty  had  relatively  little  impact  on  Q  for  Y  =  0.5;  and 
when  Q  values  were  lower,  s  values,  with  one  exception,  were  higher  (the 
exception  was  for  AYD  of  5  and  BR  of  10).  For  Y  =  1,  Q  declined  in  all 
cases,  but  showed  less  sensitivity  to  BR  than  under  certainty. 

The  impact  of  uncertainty  on  s  depended  on  BR  and  y  .  For  lower 
values  of  BR ,  s  decreased,  while  for  higher  values  s  increased  (for  Y  =  .5) 
or  the  discrepancy  between  certain  and  uncertain  values  narrowed  (for  Y  - 
1.0).  For  a  very  high  BR  of  500,  s  increased  significantly  even  for  Y  = 
1. 

Intuitively,  uncertainty  suggests  lower  values  of  s  in  order  to 
hedge  until  more  is  known,  but  at  the  same  time  with  uncertainty  higher  s 
values  are  generally  needed  to  get  the  same  protection.  Backup  Table  1  offers 
insight  into  how  this  tradeoff  is  apparently  working  to  determine  the  s  values. 
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Reported  for  each  case  are  the  probability  that  demand  in  the  firBt  (L+l) 
weeks  will  exceed  the  s  value  found  to  be  optimal  at  time  of  fielding,  with 
the  probability  accounting  for  uncertainty  as  to  the  true  mean  as  well  as 
variance  of  demand  around  the  mean.  With  two  minor  exceptions  (L:26,  BR:100, 
Y  -  .5  and  AYD:5,  BR:10,  Y  =  .5),  less  protection  was  provided  by  the  optimum 
s  as  uncertainty  increased,  even  in  cases  where  the  optimum  s  upon  which 
the  calculations  were  based  was  higher  for  uncertainty  than  for  certainty. 

Table  2  shows  the  pre-fielding  pattern  of  buys  for  each  case. 
For  example,  for  the  case  (L:78,  BR:10,  Y:0)  the  first  buy  was  for  6,  a  second 
buy  was  made  at  period  (-49),  or  49  weeks  before  fielding,  increasing  on-order 
to  14,  and  a  third  buy  at  period  (-13)  increased  on  order  to  22.  The  first 
buy  is  always  made  L  weeks  before  fielding. 

In  every  ca*se  for  which  BR  was  100,  the  optimum  amount  on  order 
increased  monotonically  as  a  function  of  Y,  for  all  pre-fielding  periods. 
For  example,  comparing  y  -  0  and  Y  =  1.0,  for  the  base  case,  the  optimum 
initial  order  is  12  versus  9,  and  on-order  is  increased  to  29  at  (-34)  versus 
to  only  17  at  (-25);  (-25)  is  9  periods  after  (-34).  Results  are  more  mixed 
at  BR  =  10,  but  clearly  there  is  a  greater  tendency  to  react  to  uncertainty 
by  increasing  buys  before  fielding  than  there  is  after  fielding. 

The  explanation  is  that  hedging  -  buying  less  and  waiting  for  more 
information  -  is  a  more  viable  strategy  after  fielding  than  pre-fielding. 
After  fielding  new  information  becomes  available  each  week,  while  during 
pre-fielding  nothing  is  learned  until  demand  begins.  To  support  this  theory 
we  show  below  the  optimum  s  values  from  period  (-52)  thru  period  (-1)  for 
the  case  (Base,  BR:100,  y  =  1.0)  and  we  observe  that  the  largest  s  value 
of  25  actually  occurs  before  fielding,  and  then  the  optimum  s  start  to  decline 
as  hedging  becomes  more  viable: 

Periods  (-52)  thru  (-24):  optimum  s  increase  from  0  to  25 

Periods  (-24)  thru  (-14):  optimum  s  stays  at  25 

Periods  (-14)  thru  (-1):  optimum  s  declines  from  25  to  17 

Incidentally,  the  pre-fielding  buys  for  this  case  were  inferred  as  follows: 

at  (-52)  optimum  S  was  12;  s  first  increased  to  12  at  period  (-34),  requiring 
another  buy  then. 

As  further  corroboration  of  the  importance  of  hedging  a  special 
run  was  made  of  the  same  (Base,  BR:100,  Y:1.0)  case  we  have  been  examining. 
In  this  run  (s,S)  values  were  fixed  at  (15,20)  for  periods  1  thru  13.  In 
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the  original  run  for  this  case  (15,20)  was  optimum  for  period  0  and  subsequent 
(b,S)  value6  depended  on  the  demand  experienced.  Now,  without  the  flexibility 
to  revise  (e,S)  for  periods  1  thru  13,  the  optimum  (s,S)  values  found  for 
period  0  were  (28,32). 

We  have  observed  that  the  impact  of  uncertainty  on  the  optimum 
(b,S)  values  depends  on  a  number  of  factors  including  the  backorder  cost, 
amount  of  uncertainty,  time  until  fielding.  It  would  be  difficult  to  imagine 
a  simple  heuristic  which  takes  a'l  these  factors  into  account,  so  it  is  of 
some  interest  to  determine  the  cost  of  using  a  heuristic  which  ignores  the 
impact  of  uncertainty.  Such  a  policy  is  described  in  Appendix  D. 

Table  3  shows  the  cost  increase  when  this  heuristic  is  used  as 
the  pre-fielding  policy.  Cost  increases  are  expressed  as  a  percent  of  the 
total  expected  lifetime  cost  under  an  optimum  policy.  Since  the  heuristic 
is  used  only  through  period  0,  but  the  costs  reported  are  lifetime,  the  costs 
increases  reported  tend  to  minimize  the  adverse  impact  of  the  heuristic. 
Nevertheless  it  is  clear  that  ignoring  uncertainty  may  be  feasible  for  lower 
values  of  backorder  coBt  but  is  costly  for  higher  values, 

4.3  Summary. 

The  demand  forecasting  techniques  used  for  real  world  inventory  problems 
suggest  that  the  mean  demand  is  not  known  with  certainty,  and  in  fact  is 
subject  to  change.  This  paper  addresses  uncertainty  without  ever  really 
coming  to  gr  ip6  with  the  possibility  of  mean  changes.  Nevertheless,  the 
results  should  contribute  to  the  formulation  of  improved  policies  for  the 
early  stages  of  an  item's  life,  when  uncertainty  diminishes  as  the  basis 
of  forecast  changes  from  pre-fielding  estimates  to  actual  demand  experience. 

Two  variants  of  a  dynamic  programming  approach  of  the  problem  were 
considered;  one  requires  the  existence  of  a  sufficient  statistic  for  the 
mean,  and  the  other  requires  that  the  prior  be  specified  as  a  histogram.  The 
sufficient  statistic  approach  was  explored  in  depth  and  results  were  obtained 
which  provide  insight  into  how  uncertainty  and  learning  impact  on  the  optimum 
inventory  control  parameters. 

Even  from  the  limited  range  of  cases  examined,  it  emerges  that 
the  question  of  whether  the  proper  response  to  uncertainty  is  to  raise  or 
lower  inventory  levels  has  no  simple  answer.  Once  learning  began,  it  was 
found  to  be  worthwhile  to  tolerate  higher  probabilities  of  stockout  in  the 

lead  time,  but  this  does  net  always  translate  into  lower  "s"  values.  For 
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higher  backorders  cost  parameters,  it  became  cost  effective  to  raise  s  to 
part ial lv  offset  the  greater  stockout  risk  created  by  uncertainty.  Optimum 
reorder  quantities  were  insensitive  to  uncertainty  at  the  lower  uncertainty 
level  examined,  but  did  drop  at  the  higher  level. 

During  pre-field ing  there  is  uncertainty,  but  no  immediate  learning, 
and  the  proper  response  was  more  likely  to  be  to  raise  inventory  levels, 

although  this  also  depended  on  the  level  of  the  backorder  cost  parameters, 
and  how  close  to  fielding  the  item  was.  A  heuristic  based  on  ignoring  uncertainty 
performed  satisfactorily  at  lower  backorder  cost  parameters,  but  under-bought 
seriously  for  higher  settings. 

Computer  processing  times  would  permit  implementation  of  the  algorithm 
developed,  at  least  for  selected  items  when  high  speed  computers  were  used 
for  inventory  control.  The  current  algorithm  is  somewhat  limiting  in  its 
assumptions,  and  we  discussed  why  a  more  general  model  may  be  more  feasibly 
developed  under  the  alternative  dynamic  programming  formulation  based  on 

a  histogram  prior.  Our  belief,  however,  is  that  the  most  promising  path 
for  the  future  is  to  develop  heuristics  based  on  the  dynamic  programming 

results  and  validated  by  sensitivity  testing.  Hopefully,  such  heuristics 
would  be  less  sensitive  to  correct  assumptions  about  the  exact  distribution 
of  demand,  the  exact  distribution  of  the  prior,  and  the  exact  dynamics  of 

changes  in  the  mean  than  a  more  exact  model.  The  success  of  the  Power  Approxi¬ 
mation  provides  some  precedent  for  such  a  hope  [5). 


TABLE  1.  OPTIMUM  (a,Q)  AT  FIELDING 


BR 

Y  -  0 

CM 

H 

1 

Y  -  1 

BASES 

4 

8/8 

5/8 

0/5 

10 

11/7 

9/8 

3/5 

100 

15/7 

18/6 

15/5 

500 

18/6 

24/5 

26/5 

L-26: 

10 

5/7 

4/7 

1/4 

100 

8/7 

10/5 

8/4 

L-78: 

10 

16/8 

14/8 

4/6 

100 

22/7 

26/7 

22/6 

UP-350; 

10 

9/13 

7/13 

1/10 

100 

14/12 

16/11 

12/10 

UP-5600; 

10 

12/4 

11/4 

4/3 

100 

16/4 

19/3 

17/2 

AYD-5: 

10 

5/6 

5/5 

1/4 

100 

9/5 

10/4 

9/4 

AYD-30; 

10 

22/10 

18/10 

4/8 

100 

28/9 

33/9 

25/7 

26 


TABLE  1  BACKUP.  PROBABILITY  OF  STOCKOUT 


BR 

Y  -  0 

Y  -  1/2 

Y  -  1 

BASE: 

4 

68.8% 

77.0% 

91.1% 

10 

32.5% 

47.1  % 

68.8% 

100 

5.6% 

9.5% 

22.4% 

500 

0.9  % 

2.5% 

7.3% 

L-26: 

10 

41.8% 

51.2% 

70.3% 

100 

8.1% 

7.8% 

20 . 5% 

L-78: 

10 

35.4% 

46.5% 

72.7  % 

100 

3.7% 

10.2% 

23.1% 

UP-350: 

10 

56.6% 

62.3  % 

82.9% 

100 

9.4% 

14.3% 

29.6% 

UP-5600: 

10 

22.7% 

35.2% 

62.6% 

100 

3.1% 

7.7% 

18.6% 

AYD-5 : 

10 

40 . 1% 

38.5% 

69.9% 

100 

3.5% 

7.3% 

16.7% 

AYD-30 : 

10 

31.0% 

50.2% 

78.7  % 

100 

4.2% 

12.2% 

28.8% 

2  7 
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TABLE  2.  OPTIMUM  PRE-FIELDING  ASSETS 


BR  y  -  0  Y  =  1/2  Y  «  1 


100 

9;  17 (-25) ; 

11;  24 (-27); 

12; 

29 (-34); 

500 

j  9;  16 (-29) ; 

23(-7) j 10;  20(-36) ; 30 (—18) ; 

12; 

26 (—41) ; 

42 (-28) 

L-26: 

10 

7; 

• 

8; 

6; 

100 

7;  14 (-6); 

) 

i 

i 

7;  14 (-12); 

14; 

L-78 ! 

10 

6;  14(-49) ; 

2  2 (—13) 

8;  21(-42) 

7; 

19 (-53); 

100 

9;  17 (-51); 

25 (—203 

i 

10;21(-56) ; 34 (—31) ; 

li; 

26(-62) ; 

43 (-43) 

UP- 350s 

10 

12; 

i 

1 

14; 

12 

t 

100 

16; 

t 

l 

23; 

28 

• 

> 

UP- 5 600: 

10 

4;  9 (-34); 

14 (-13)  ' 

4;  9 (—37 )  ;  15 (-20) ; 

4; 

12 (—37 ) ; 

100 

5 ; 9 (—41) ;13(-28) ; 

18 (-12) 

6;  11(-41);  17 (-30) 
24(-17) 

7; 

16 (—43) ; 

28 (-31) 

AYD-5: 

10 

4;  9 (-16); 

j 

4;  9 (-21) ; 

6; 

100 

6;  12 (-20); 

7;  14 (-23) ; 

7; 

16(-33) ; 

AYD-30: 

10 

;  8;  18 (-32); 

29 (-9)  1 

11;  27 (-28) 

!  8; 

i 

23(-37) 

100 

11;  2K-34) 

;32(*14) 

13;  27 (-33) ;  43(-21) 

!  14 

1 

;32(-42) ; 54 (—30) ; 

28 


TABLE  3. 


INCREASE  IN  COST 


Y  - 


BR 


Y  ■  0 


Y  -  1/2 
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APPENDIX  A 

SUFFICIENT  STATISTICS 


A  convenient  reference  for  most  of  this  material  is  DeGroot  {41. 

Suppose  that  we  are  sampling  from  a  random  variable  X  with  probability 
function  (or  probability  density  function  if  X  is  continuous)  f(X;w),  where 
v  is  an  unknown  parameter.  All  the  sample  values  we  see  are  governed  by 
the  same  w,  but  w  is  the  realization  of  a  random  variable  W,  with  distribution 
g(W),  termed  the  prior. 

'V 

Let  x  be  the  sample  values,  where  n.  is  used  to  denote  a  vector. 

In  accordance  with  the  Bayesian  approach  to  statistics,  it  is  meaningful 

to  reference  the  updated  prior  g(Vfjx),  which  reflects  the  changed  probabilities 

a, 

of  what  W  might  be  after  taking  into  account  the  observed  x,  as  well  as  the 
original  prior  g(W). 

A  statistic  T(x)  is  any  function  defined  over  all  possible  values 
of  x .  A  sufficient  statistic  is  defined  by  the  fact  that  g(w|T(x))  “  g(W)|  x) 
for  all  possible  values  of  x,  where  the  equality  must  pertain  only  to  values 
of  W  with  positive  probability.  There  is  always  at  least  one  sufficient 

statistic,  namely  T(x)  =  x,  but  we  are  interested  in  sufficient  statistics 

'V 

of  fixed  dimensionality;  i.e.,  the  dimensionality  of  T(x)  does  not  increase 

% 

as  sample  size  increases.  An  example  of  T(x)  is  the  2-dimensional  vector 
(n,E  where  n  is  the  sample  size  end  x^  the  sample  values. 

In  the  definition  of  a  sufficient  statistic  the  function  g  denotes 
a  mapping  from  the  sample  space  to  a  probability  space.  Now  suppose  g  can 

*\j 

be  written  as  a  mathematical  formula  with  T(x)  and  the  sample  size  as  parameters; 

/v  'v 

also,  suppose  that  if  T(x,y)  is  the  sufficient  statistic  after  observing 

/V  'V  ...  ... 

a  total  sample  (x,y)  where  x  is  n-d imens ional  and  y  is  m-d imens lona 1 ,  then 
g(w|T(x,y),  n+m)  *  K  g  (w|t(x),  n)  g  (w|T(y),  m) 

where  K  is  a  normalizing  constant  chosen  so  that  the  right  hand  side  integrated 
(or  summed)  over  W  equals  one.  The  function  g  is  then  called  a  conjugate 
prior  to  the  distribution  f(X;w). 

It  turns  out  that  whenever  there  is  a  fixed  dimensional  sufficient 
statistic,  there  is  a  conjugate  prior.  In  Appendix  B  we  treat  explicitly 
the  Poisson  conditional,  f(X;w),  and  the  Gamma  conjugate. 

f(X;v)  is  said  to  belong  to  the  exponential  family  if  for  any  possible 
values  of  X  ,w,  ^ 

f(x;w)  =  a(w)  b(x)  exp  g±M  h^x) 
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For  any  member  of  the  exponential  family  there  is  a  sufficient  statistic 
of  dimension  k. 

However,  while  the  exponential  family  ia  very  rich  in  that  it  encom¬ 
passes  many  distributions,  it  is  possible  for  the  same  distribution  both 
to  be  and  not  to  be  part  of  the  exponential  family!  Let  us  illustrate  with 
the  Negative  Binomial  which  is  much  used  in  Army  inventory  models. 

The  Negative  Binomial  may  be  written: 

f(xsr-,,)  ■  rWCx+l)  "  f^j^/expUogU-p)*) 

Now  if  r  is  known,  and  p  is  the  unknown  parameter,  the  Negative  Binomial 
satisfies  the  condition  for  membership  in  the  exponential  family.  However, 
in  an  inventory  context  this  would  be  an  unlikely  situation  since  r  equals 
the  square  of  the  mean  divided  by  the  difference  between  the  variance  and 
mean.  If,  as  is  more  likely,  the  p  parameter  is  known,  p  being  the  reciprocal 
of  the  variance  to  mean  ratio,  and  r  is  unknown,  f(X;r)  does  not  satisfy 
the  condition  for  membership  in  the  exponential  family  since  r(r+x)  cannot 
be  written  as  expP  g,  (r)k  (x)  ] . 
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APPENDIX  b 

MATHEMATICS  FOR  GAMMA-POISSON 


Bayesian  Updating 

A  convenient  reference  for  most  of  this  material  is  Brown  and  Rogers 


[3]  . 


Write  the  Poisson,  Gamma  and  Negative  Binomial  distributions  as: 


Poisson:  f(d;x) 


xV* 

d! 


Gamma : 


gU;a,b)  -  X3  1  e_b* 


Negative  Binomial:  h[d;a,b/(b+l)  ]  =  p(d)t('a-l)  ^b+T‘)  (b+T^ 


Then  the  following  are  the  means  and  variances: 

Mean 

Poisson  X 

Gamma  a/b 

Negative  Binomial  a/b 


Var iance 

X 

o 

a/b“ 

a/b^  +  a/b 


Note  the  variance  of  the  Negative  Binomial  is  the  variance  of  a  Poisson  with 

2 

mean  a/b  plus  uncertainty  a/b  around  the  mean. 

If  demand  in  a  period  is  Poisson,  with  unknown  X  sampled  from  a 
Gamma  prior,  the  distribution  of  demand  allowing  for  uncertainty  about  X 
is  Negative  Binomial: 

h[d; a ,b/ (b+1) }  =  g(X;a,b)  f(d;X) 

If  after  n  periods  demands  of  d,,  d  ...,d  are  observed,  then  (nj  d  )  is 

1  l  n  i 

a  sufficient  statistic  and  the  updated  prior  is  a  Gamma  with  revised  parameters 


a  -  a,. 


+  Idi,  b0  =  b+n;  the  distribution  of  demand  in  one  period. 


given  (n,  Ed  is  Negative  Binomial  with  tneai  a/b  and  variance  a/h  +  a/b. 

The  distribution  over  p  periods  is  Negative  Binomial  with  mean  ap/b  and  variance 

2  2 

ap  lb  +  ap/b. 

We  can  redefine  X  as  the  expected  demand  per  weapon  system  per 
period.  Then  if  are  the  number  of  systems  deployed  in  period  i  we  could 
use  tne  above  results  with  these  substitutions: 
n 

i  W  is  substituted  for  n 
i=l  1 

n+p 

l  W  is  substituted  for  r 

i=n+l  1 

3b 


Computation  of  Functional  Values 

Let  Bc  be  backorder  cost  per  backorder  per  period  and  Hc  be  holding 
cost.  Let  f2  be  as  defined  in  Section  2.1.  Capital  letter  functions  ere 
from  Chapter  3  and  the  f,  g  and  h  functions  are  as  defined  in  this  Appendix. 

y 

f„(y;t  )  =  a  H  Z  h(d|t  )  +  a  E  Z  h(d!t  ) (d-y) 

2  n  c  n  n 

d»o  d=y+l 

<B  OO  y 

^  +1  h(dltn)(y-d)  =  Z  h(d!tn)(y-d)  -  1  h(d|tn)(y~d) 

^  d”o  d°o 

=  y  -  Expected  Value  (d I t  )  -  Z  h(d!t  ) (y-d) 

1  d=o 


B (x; P , i)  =  B  Z  l  aP  f (d ; i  )  (d-x) 
C  p-1  d-x+1  P 

P  X 

H (x;P ,  i)  -  H  Z  T.  aPf(d;i  )(x-d) 

C  p-1  d=o  P 


i  =  (i) (p) 


For  G(x;i)  and  H(x;“,i)  we  assume  costs  are  recorded  continuously 
-«t  -a 

with  a  discount  of  e  where  t  is  the  elapsed  time  in  periods  and  e  = 

-op  -u.p  p 

a.  Thus,  for  p  periods,  the  discount  is  e  =  (e  /  =  a  . 

Now  if  demand  is  Poisson  with  mean  of  i  per  period,  the  time  to 

2  2 

order  x  units  is  Gamma  with  mean  x/ i  and  variance  x/ i  from  which  we  may 

3 

infer  that  the  Gamma  parameters  are  a  =  x  and  b  =  i.  Therefore, 

G (x;  i)  «  !  e  Ut  g(t;x,  i)dt  =  /  -pyTX  t*  1  e  ^  dt 

i  X  7  (a+ilx  x-1  -(a+i)t  i  X 

W  D  Too  c  e 

since  the  last  integral  is  the  integral  of  a  Gamma  density  over  its  range. 

Now  suppose  we  numbered  (conceptually)  x  assets  from  1  to  x  with 


*This  well  known  result  follows  from  the  :act  that  Poisson  demand  implies 
exponential  time  between  demand,  and  the  sum  of  independent  exponentials 
is  Gamma. 

"The  use  of  the  Gamma  as  shown  here  was  suggested  by  Karl  Kruse  [9) 


,  .  th 

asset  1  issued  first  and  so  on.  Suppose  the  k  asset  is  issued  at  time 

t  h 

t.  Total  holding  costs  for  the  k  asset  would  be: 

H  /  dj  -  ^  [1  - 

CO  J  a 

Since  t  is  a  random  variable  with  distribution  g(t;k,i), 

H  x  «■ 

It(x;%i)  =—■  l  f  g(t;k,i)  [1  -  e  °  ] 

•  a  k=l  o 

H  x  4  k 

=  !c  I  U  -  (^)  ] 

a  k=l 

H  ,  w  x 

=  f  U  -  (r>^  r  -  l/<«+l) 

A  continuous  analogue  of  B( Q )  was  used  in  determining  the  optimum 
"s"  and  developing  I.(s,s+Q;i).  Recall 

B(Q)  *^B(s;L,i)  -  B(s+Q;L,i) 


Consider  each  of  the  Q  units  separately  and  index  them  by  k.  Let  t  be  the 
th  K  . 

time  until  the  k  of  the  Q  assets  is  demanded  measured  from  the  time  the 

th 

order  is  placed.  For  the  k  of  the  Q  assets  to  be  demanded,  there  must 
be  (s+k)  demands,  so  the  density  on  l  is  g(t^;  s+k,  i).  If  t  occurs  before 

L  there  is  a  backorder  and  total  backorder  cost  for  that  backorder  is: 

L  -at. 


Thus 


Now , 


R 


-at 


/  e  dt  =  B  [e 


k  -aL. 
-  e  J 


uk 

0  L 


B(Q)  = 

Bc  k=l  o 


/  g(t ; s+k, i)  (e  at  -  e  aL] 


L  s+k  L 

/  g(t ; s+k, i)  e  at  =  (ttt) 
o  1  o 


/  g(t;s+k,i+a) 


*  ^ 


s+k 


[1  -  F (s+k-1;  (i+a)L) ] 


where  F( s+k-1 ; (i+a)L)  is  the  probability  a  Poisson  random  variable  with  mean 
(L+a)L  is  s  i  k  -  1 ;  i.o.  t  <  \.  is  equivalent  to  s+k  demands  by  time 

L. 
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Putting  all  these  results  together, 


B(Q)  _ 


Q 

E 

k=l 


s+k 


-  e 


-cxL 


Q 

E 

k=l 


[1 


[1 


F(s+k-l ;  (i+a)L) ] 

F(s+k-l;i  L)] 


Application  of  Functional  Values. 

The  various  functional  values  H(  )  and  B(  )  are  used  to  derive 
the  L(s,s+Q,i)  and  then  the  (x,t)  as  discussed  in  the  section  of  the  report 
on  Starting  the  Recursion.  Then  the  recursive  equation  is  used. 
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APPENDIX  C 

ALGORITHM  PRECISION  AND  PROCESSING  TIMES 

Precision.  Ttu-  issues  investigated  briefly  were: 

(1)  In  the  formula  from  Section  2.2  for  tn'ax  ,  what  should 

n 

PSDF.V  be  multiplied  by? 

(2)  In  approximating  the  Gamma  prior  by  a  histogram,  how  many  points 
should  be  used  (Section  3.0)? 

(3)  How  much  accuracy  would  be  lost  if  we  limited  the  number  of  values 
of  the  sufficient  statistic  for  which  costs  were  calculated,  3nd  found  the 
rest  by  linear  interpolation? 

Four  runs  were  made  for  the  base  case  with  Y  =  1.0  and  BR  23  100 
(see  model  results  for  a  description  of  the  base  case).  The  first  and  second 

runs  multiplied  PSDEV  by  4  and  used  10  histogram  points,  but  the  second 

run  calculated  costs  for  a  maximum  of  30  values  of  the  sufficient  statistic 
in  any  given  period.  At  period  52,  there  were  (coincidentally).  52  values 

so  that  22  were  found  by  interpolation.  Run  3  differed  from  run  3.  in  that 
PSDEV  was  multiplied  by  5,  while  run  4  differed  from  run  1  in  that  only  5 
points  were  used  for  the  histogram  in  starting  the  recursion. 

Our  conclusions  were  that  Run  1  settings  were  adequate  and  that 
interpolation  might  be  practical.  We  recorded  for  each  run  the  (s,S)  values 
at  fielding  and  a  lead  time  before  fielding.  They  were  the  same  for  each 
run,  (15-20)  and  (0,12)  respectively.  At  fielding  we  recorded  the  expected 
lifetime  costs  (in  thousands)  if  assets  were  1  unit  and  you  ordered  up  to 
either  13.  14,  15,. ...21,  or  22  (Table  C.l).  A  lead  time  before  fielding 

we  recorded  expected  cost  if  assets  were  0  and  you  ordered  up  to  8,  9,  10. ..15, 
16  units.  We  focused  on  how  the  costs  changed  as  a  function  of  assets  bought, 
looking  at  asset  values  around  the  inventory  control  parameters  selected. 
Processing  Times 

The  algorithm  was  run  on  a  CDC  CYBER  76  computer  under  the  SCOPE 
2.1.5  operating  system.  For  the  base  case,  with  BR  =  100  and  Y  =  1.0,  17.5 
CPU  seconds  were  used.  For  Run  2,  using  interpolation,  CPU  seconds  dropped 
to  13.4  seconds.  Of  the  17.5  CPU  seconds  for  Run  1,  approximately  4.5  were 
used  to  get  through  the  "Starling  the  Recursion"  step. 
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TABLE  C.l.  COMPARISON  OF  EXPECTED  COSTS  (AT  FIELDING) 


RUN  ORDER  TO: 


13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

1 

$89.7 

$88.1 

$86.4 

$84.8 

$83.2 

$81.7 

$80.2 

$78.8 

$77.4 

$76.1 

2 

89.9 

88,3 

86.6 

84,9 

83,4 

81.9 

80.4 

79.0 

71.6 

76.3 

3 

90.3 

88.7 

87.0 

85.3 

83.8 

82.3 

80.8 

79.4 

78.0 

76.7 

4 

89.3 

87.7 

86.0 

84.4 

82.8 

81.3 

79.8 

78.4 

77.1 

75.7 

TABLE  C.2.  COMPARISON  OF  EXPECTED  COST  (AT  FIRST  BUY) 


RUN  ORDER  TO: 


8 

7 

10 

11 

12 

13 

14 

15 

16 

1 

$87,4 

$86.0 

$84.5 

$83.1 

$81.7 

$80.3 

$78.9 

$77.6 

$76.2 

2 

87.6 

86.2 

84  .X8 

83.3 

81.9 

80.5 

79.2 

77.8 

76.4 

3 

87.9 

86.5 

85.0 

83.6 

82.2 

80.8 

79.4 

79.0 

76.7 

4  Not  Recorded 
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APPENDIX  D 

PRE-FIELDING  HEURISTIC 


Suppose  we  wish  to  order  every  QT  (for  Q  Time)  weeks,  and  for  a 
moment  assume  we  order  whenever  assets  fall  below  lead  time  demand. 

Then,  we  draw  a  time  line  as  follows: 

|  QT  WEEKS _ | _ (L+l)  WEEKS _ j 

PRESENT  !  I 

If  we  order  up  to  our  expected  demand  in  the  next  (QT+L+1)  weeks,  and  demand 
occurs  as  expected,  we  will  order  next  in  QT-weeks.  If  ,:present"  is  pre-f  ielding , 
some  of  the  weeks  on  the  time  line  will  occur  before  fielding  and  we  add 
in  zeroes  for  those  weeks. 

Now  suppose  the  reorder  point  (the  s  value)  is  based  on  lead  time 
demand  plus  "n"  standard  deviations,  and  that  based  on  the  Poisson  assumption 
the  standard  deviation  is  the  square  root  of  the  mean.  Then  we  must  add 
to  our  order-up-to  level  "n"  multiplied  by  the  square  root  of  demand  in  the 
(L+l)  weeks  shown  on  the  t  m.e  line. 

The  heuristic  takes  as  input  the  (s,Q)  values  found  to  be  optimum 
at  time  0  (just  before  fielding)  for  the  cose  with  no  uncertainty.  From 
these  it.  computes  "n"  and  QT  and  applies  them  as  discussed  to  get  the  order-up-to 
level  at  any  time  prior  to  fielding. 

For  the  purpose  of  getting  the  reorder  point  (s),  a  different  time 
line  is  developed: 

_ (L-t-l)  WEEKS _ _ _ QT  UEEKS _ 

PRESENT 

Now  demand  in  the  first  (L+l)  weeks  is  used  to  get  s,  since  this  determines 
exposure  to  backorders  in  the  lead  time.  This  is  still  consistent  with  the 
derivation  of  the  order-up-to  leveL,  since  that  derivation  assumed  that  in 
QT  weeks  the  s  value  would  be  based  on  demand  in  the  next  (L+l)  weeks  counting 
from  that  point . 
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